Search CORE

10 research outputs found

Advances in Feature Selection with Mutual Information

Author: A. Kraskov
C. Borggaard
C. Krier
D. François
D. François
D. Scott
F. Rossi
L.F. Kozachenko
M.N. Goria
T. Cover
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

The selection of features that are relevant for a prediction or classification problem is an important problem in many domains involving high-dimensional data. Selecting features helps fighting the curse of dimensionality, improving the performances of prediction or classification methods, and interpreting the application. In a nonlinear context, the mutual information is widely used as relevance criterion for features and sets of features. Nevertheless, it suffers from at least three major limitations: mutual information estimators depend on smoothing parameters, there is no theoretically justified stopping criterion in the feature selection greedy procedure, and the estimation itself suffers from the curse of dimensionality. This chapter shows how to deal with these problems. The two first ones are addressed by using resampling techniques that provide a statistical basis to select the estimator parameters and to stop the search procedure. The third one is addressed by modifying the mutual information criterion into a measure of how features are complementary (and not only informative) for the problem at hand

arXiv.org e-Print Archive

Crossref

HAL Descartes

DIAL UCLouvain

A computationally efficient estimator for mutual information

Author: Beirlant J
Kozachenko L.F
Publication venue: 'The Royal Society'
Publication date
Field of study

Crossref

A Non-parametric Maximum Entropy Clustering

Author: A.C. Müller
A.K. Jain
E. Gokcay
G.J. McLachlan
H. Hino
L.F. Kozachenko
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Crossref

High-Dimensional Entropy Estimation for Finite Accuracy Data: R-NN Entropy Estimator

Author: A.O. Hero
D.B. Russakoff
D.W. Scott
F. Maes
F.P. Preparata
G.A. Darbellay
H. Neemuchwala
H. Singh
J. Beirlant
J. Pluim
L.F. Kozachenko
M.R. Sabuncu
P. Viola
R. Sedgewick
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

Crossref

Mutual Information Estimation in Higher Dimensions: A Speed-Up of a k-Nearest Neighbor Based Estimator

Author: A. Hyvaerinnen
A. Kraskov
C. Shannon
J. Beirlant
J. Theiler
J.H. Freidman
J.L. Bentley
L. Paninski
L.F. Kozachenko
M.M. Hulle Van
M.N. Goria
N. Kwak
P. Grassberger
S. Bingham
T. Schreiber
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

Crossref

Scalable Indexing of HD Video

Author: C. Saraceno
D.G. Lowe
F. Chevalier
G.R. Terrell
H.A. Sturges
I. Ahmad
J. Benois-Pineau
K. Fukunaga
K. Fukunaga
L.F. Kozachenko
M. Do
M. Goria
N. Leonenko
P. Piro
P. Salembier
S. Jehan-Besson
S. Mallat
Y. Liu
Y. Liu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

Crossref

Fast parallel estimation of high dimensional information theoretical quantities with low dimensional random projection ensembles

Author: A. Hero
B. Póczos
B. Póczos
C. Jutten
C. Shannon
E.G. Learned-Miller
F.J. Theis
F.R. Bach
H. Neemuchwala
J. Kybic
J. Matoušek
L.F. Kozachenko
R.I. Arriga
R.Y. Rubinstein
S. Arya
S. Gaito
W. Johnson
Z. Szabó
Z. Szabó
Publication venue: Springer-Verlag
Publication date: 01/01/2009
Field of study

Abstract. The estimation of relevant information theoretical quantities, such as entropy, mutual information, and various divergences is computationally expensive in high dimensions. However, for this task, one may apply pairwise Euclidean distances of sample points, which suits random projection (RP) based low dimensional embeddings. The Johnson-Lindenstrauss (JL) lemma gives theoretical bound on the dimension of the low dimensional embedding. We adapt the RP technique for the estimation of information theoretical quantities. Intriguingly, we find that embeddings into extremely small dimensions, far below the bounds of the JL lemma, provide satisfactory estimates for the original task. We illustrate this in the Independent Subspace Analysis (ISA) task; we combine RP dimension reduction with a simple ensemble method. We gain considerable speed-up with the potential of real-time parallel estimation of high dimensional information theoretical quantities. Key words: independent subspace analysis, random projection, pairwise distances, information theoretical estimations

CiteSeerX

Crossref

UCL Discovery

A Note on Bayesian Inference for Long-Range Dependence of a Stationary Two-State Process

Author: A. Chronopoulou
A. Grelaud
D. Heath
D. Kwiatkowski
G. Samorodnitsky
G.A. Churchill
H. Hurst
H.V. Singh
J. Beirlant
J. Beran
J. Beran
J. Beran
J.-M. Bardet
J.K. Pritchard
K.M. Abadir
L. Giraitis
L.F. Kozachenko
M.S. Taqqu
P. Abry
P.M. Robinson
T.M. Cover
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Application of Mutual Information Methods in Time–Distance Helioseismology

Author: A. Kraskov
A.C. Barato
A.C. Barato
Alexei A. Pevtsov
C. Shannon
D.-Y. Chou
Dustin Keys
E.T. Jaynes
J.D. Victor
J.M. Horowitz
K.-R. Chen
L.F. Kozachenko
O. Burtseva
P.H. Scherrer
S. Kholikov
S. Kholikov
S. Kholikov
Shukur Kholikov
T. Sagawa
T.L. Duvall Jr.
T.M. Cover
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Efficient Estimation of Information Transfer

Author: A. Kaiser
A.M. Fraser
B. Gourevitch
B. Pompe
B.A. Olshausen
C.E. Shannon
C.E. Shannon
C.J. Cellucci
C.M. Gray
C.O. Daub
D.H. Hubel
D.H. Johnson
D.W. Hahs
E. Pereda
G. Niso
G.A. Darbellay
G.A. Miller
J. Theiler
J. Victor
J.D. Victor
J.M. Nichols
K. Hlavácková-Schindler
K. Hlaváčková-Schindler
L. Barnett
L. Barnett
L. Barnett
L. Faes
L.F. Kozachenko
L.Y. Cao
M. Chávez
M. Lindner
M. Paluš
M. Ragwitz
M. Vejmelka
M. Wibral
M. Young-Il
M.S. Lewicki
N. Ay
N. Wiener
N. Wiener
P. Merkwirth
P. Zezula
P.C.W. Davies
P.E. Latham
P.M. Vaidya
R. Steuer
R. Vicente
R. Vicente
R.R. Ruyter van Steveninck de
R.T. Canolty
S.H. Nirenberg
T.. Schreiber
T.M. Cover
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Crossref